Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher
- Year
- Medium
- Type
- BLLDB-Access:
  - free (81)
  - subject to license (0)

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5

Hits 1 – 20 of 81

1	MAGIC DUST FOR CROSS-LINGUAL ADAPTATION OF MONOLINGUAL WAV2VEC-2.0
	Khurana, Sameer; Laurent, Antoine; Glass, James
	In: ICASSP 2022 ; https://hal.archives-ouvertes.fr/hal-03544515 ; ICASSP 2022, May 2022, Singapour, Singapore (2022)
	BASE
	Show details

2	Simple and Effective Unsupervised Speech Synthesis ...
	Liu, Alexander H.; Lai, Cheng-I Jeff; Hsu, Wei-Ning. - : arXiv, 2022
	BASE
	Show details

3	Learning Audio-Video Language Representations
	Rouditchenko, Andrew. - : Massachusetts Institute of Technology, 2021
	Abstract: Automatic speech recognition has seen recent advancements powered by machine learning, but it is still only available for a small fraction of the more than 7,000 languages spoken worldwide due to the reliance on manually annotated speech data. Unlabeled multi-modal data, such as videos, are now increasingly available in many different languages and provide opportunities to scale speech technologies. In this thesis, we introduce models and datasets for learning visually grounded spoken language from raw audio in videos. We propose a self-supervised audio-video model that learns from the English narration naturally present in instructional videos to relate spoken words and sounds to visual content. Our model can recognize spoken words and natural sounds in audio queries to retrieve relevant visual clips, supporting its application to video search directly using audio and spoken queries, without needing to transcribe speech to text. We further demonstrate that our model can learn multilingual audiovideo representations and can successfully perform retrieval on Japanese videos. Since our approach only requires audio-visual data without transcripts, we believe it is a promising direction to enable novel speech processing tools. ; M.Eng.
	URL: https://hdl.handle.net/1721.1/139024
	BASE
	Hide details

4	Cascaded Multilingual Audio-Visual Learning from Videos ...
	Rouditchenko, Andrew; Boggust, Angie; Harwath, David. - : arXiv, 2021
	BASE
	Show details

5	Magic dust for cross-lingual adaptation of monolingual wav2vec-2.0 ...
	Khurana, Sameer; Laurent, Antoine; Glass, James. - : arXiv, 2021
	BASE
	Show details

6	Text-Free Image-to-Speech Synthesis Using Learned Segmental Units ...
	The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing 2021; Glass, James; Harwath, David. - : Underline Science Inc., 2021
	BASE
	Show details

7	Exposure Bias versus Self-Recovery: Are Distortions Really Incremental for Autoregressive Text Generation? ...
	The 2021 Conference on Empirical Methods in Natural Language Processing 2021; Glass, James; He, Tianxing. - : Underline Science Inc., 2021
	BASE
	Show details

8	Mitigating Biases in Toxic Language Detection through Invariant Rationalization ...
	Chuang, Yung-Sung; Gao, Mingye; Luo, Hongyin. - : arXiv, 2021
	BASE
	Show details

9	Mitigating Biases in Toxic Language Detection through Invariant Rationalization ...
	The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing 2021; Chen, Yun-Nung; Chuang, Yung-Sung. - : Underline Science Inc., 2021
	BASE
	Show details

10	A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning
	Khurana, Sameer; Laurent, Antoine; Hsu, Wei-Ning...
	In: Interspeech 2020 ; https://hal.archives-ouvertes.fr/hal-02912029 ; Interspeech 2020, Oct 2020, Shanghai, China (2020)
	BASE
	Show details

11	Similarity Analysis of Contextual Word Representation Models ...
	Wu, John M.; Belinkov, Yonatan; Sajjad, Hassan. - : arXiv, 2020
	BASE
	Show details

12	CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning ...
	Khurana, Sameer; Laurent, Antoine; Glass, James. - : arXiv, 2020
	BASE
	Show details

13	A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning ...
	Khurana, Sameer; Laurent, Antoine; Hsu, Wei-Ning. - : arXiv, 2020
	BASE
	Show details

14	What Was Written vs. Who Read It: News Media Profiling Using Text Analysis and Social Media Context ...
	Baly, Ramy; Karadzhov, Georgi; An, Jisun. - : arXiv, 2020
	BASE
	Show details

15	Vector-Quantized Autoregressive Predictive Coding ...
	Chung, Yu-An; Tang, Hao; Glass, James. - : arXiv, 2020
	BASE
	Show details

16	Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies ...
	Liu, Alexander H.; Chung, Yu-An; Glass, James. - : arXiv, 2020
	BASE
	Show details

17	Improved Speech Representations with Multi-Target Autoregressive Predictive Coding ...
	Chung, Yu-An; Glass, James. - : arXiv, 2020
	BASE
	Show details

18	Classifying Alzheimer's Disease Using Audio and Text-Based Representations of Speech
	Haulcy, R'mani(R'mani Symon); Glass, James R
	In: Frontiers (2020)
	BASE
	Show details

19	Identification of digital voice biomarkers for cognitive health
	Lin, Honghuang; Karjadi, Cody; Ang, Ting F. A....
	In: Explor Med (2020)
	BASE
	Show details

20	On the Linguistic Representational Power of Neural Machine Translation Models
	Belinkov, Yonatan; Durrani, Nadir; Dalvi, Fahim...
	In: Computational Linguistics, Vol 46, Iss 1, Pp 1-52 (2020) (2020)
	BASE
	Show details

Page: 1 2 3 4 5

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern